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1 Introduction 

Many extensions of the standard model (SM) predict new massive charged gauge bosons HHSl. 
The W' boson, for example, is a heavy partner of the SM W gauge boson that could manifest 
itself in proton-proton collisions at the CERN LHC. Searches for a high-mass W' boson reso¬ 
nance have been performed at the Tevatron fH |5] and the LHC ll6VIT^ in the lepton, diboson, 
and diquark final states. We present a search based on the W'+ —> tb (and charge conjugate) 
decay. This decay channel is of particular interest because the SM backgrounds can be greatly 
reduced compared to those for W' decays to light quarks, and some models predict a stronger 
W' coupling to third generation quarks IfT^ . The hadronic decay channel (W' —> tb —> qqbb) 
is presented in detail, along with the combination with the already published leptonic channel 
(W' —> tb —7- Ivhh) |j6l. The combination of these two channels leads to a significant increase in 
sensitivity. 

The most general, lowest dimension effective Lagrangian that describes the interaction of the 
W' boson with quarks IfT/l can be written as: 

^ + h-c- (1) 

where the parameters flq.q, and fl^q^ represent the left-handed and right-handed couplings 
of the W' boson to quarks, gw is the SM weak coupling constant, and Vq^q. is the Cabibbo- 
Kobayashi-Maskawa matrix. Lor a SM-like W' boson, aJj q, = 1, Aq q, = 0. 

Strict cross section upper limits have previously been placed in the case of low mass W' sig¬ 
nals mB- At higher masses, specifically Mw' ^ 1.3 TeV, the top quark is highly energetic. 
Because of the Lorentz boost, the angular separation between the top quark decay products (W 
boson and b quark) is small. The final state particles resulting from the hadronization of the 
b quark and the decay of the W boson into light quarks usually overlap, resulting in a single 
jet with top flavour, the "top quark jet", or t jet. Dedicated methods, applied to resolve the 
substructure of this t jet, enable background processes to be strongly suppressed. We apply b 
jet identification algorithms (b tagging) to the b jet from the W' decay in order to further reduce 
the SM background. 

We reconstruct the W' boson mass as the invariant mass of the top and bottom quarks (M*), 
and use the Mtb distribution to derive limits on the production cross section of the W' boson. 
We also obtain limits on the couplings of the W' boson to quarks, and present the W' production 
cross section limits obtained from a combination of hadronic and leptonic decay channels. The 
leptonic decay charmel alone excludes a W' boson with a mass less than 2.05 TeV l6|, which 
prior to this paper was the most restrictive limit obtained to date in the tb final state. The 
combination of the leptonic and hadronic channels makes it possible to extend this limit. 


2 The CMS detector 

The central feature of the CMS apparatus IfTSi is a superconducting solenoid of 6 m internal di¬ 
ameter, providing a magnetic field of 3.8 T. Within the field volume are a silicon pixel and strip 
tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintilla¬ 
tor hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Lorward 
calorimeters extend the pseudorapidity coverage provided by the barrel and endcap detectors. 
Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside 
the solenoid. 
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3 Event reconstruction 


The silicon tracker detects charged particles within the pseudorapidity range \ri\ < 2.5. It 
consists of 1440 silicon pixel and 15148 silicon strip detector modules and is located in the 
field of the superconducting solenoid. For non-isolated particles with 1 < px < 10 GeV and 
Ip I < 1.4, the track resolutions are typically 1.5% in px and 25-90 (45-150) pm in the transverse 
(longitudinal) impact parameter |[19ll . Non-isolated particle tracks are of particular importance 
to this analysis, as they are typically the constituents of jets, and are found within the detector 
barrel acceptance with an efficiency larger than 90%. 

In the region |p| < 1.74, the HCAL towers have widths of 0.087 in pseudorapidity and 0.087 
radians in azimuth {(p). In the r]-(p plane, and for |p| < 1.48, the HCAL towers map on to 5 x 5 
ECAL crystal arrays to form calorimeter towers projecting radially outwards from close to the 
nominal interaction point. At larger values of |p|, the size of the ECAL and HCAL towers 
increases and the matching ECAL arrays contain fewer crystals. Within each tower, the energy 
deposits in ECAL and HCAL towers are summed to define the calorimeter tower energies, 
which are subsequently used to provide the energies and directions of hadronic jets. 

A more detailed description of the CMS detector, together with a definition of the coordinate 
system used and the relevant kinematic variables, can be found in Ref. IflSl . 


3 Event reconstruction 

The particle-flow (PE) event algorithm Il20ll^ reconstructs and identifies each individual par¬ 
ticle with an optimized combination of information from the various elements of the CMS 
detector. The energy of photons is directly obtained from the ECAL measurement, corrected 
for zero-suppression effects. The energy of electrons is determined from a combination of the 
electron momentum at the primary interaction vertex as determined by the tracker, the en¬ 
ergy of the corresponding ECAL cluster, and the energy sum of all bremsstrahlung photons 
spatially compatible with originating from the electron track. The momentum of muons is ob¬ 
tained from the curvature of the corresponding track as determined by the tracker and muon 
system. The energy of charged hadrons is determined from a combination of their momen¬ 
tum measured in the tracker and the matching ECAL and HCAL energy deposits, corrected for 
zero-suppression effects and for the response function of the calorimeters to hadronic showers. 
Finally, the energy of neutral hadrons is obtained from the corresponding corrected ECAL and 
HCAL energies. 

Jets are reconstructed using the Cambridge-Aachen (CA) Il22l l23ll algorithm with a distance 
parameter of 0.8 (CA8 jets) as implemented by Fastjet 3.0.4 Il24l |25l to cluster PE candidates 
into jets. This algorithm clusters constituents (the reconstructed PE candidates in each event) to 
form jets based only on the angular distance between them. The C A algorithm has been shown 
to have higher efficiency for distinguishing jet substructure Il26l than competing jet clustering 
algorithms. 

Jet momentum is determined as the vector sum of all particle momenta in the jet, and is found 
in the simulation to be equal to the true momentum at hadron level within 5% to 10% over the 
full Px spectrum and detector acceptance. 

The jet energy resolution amounts typically to 15% at 10 GeV, 8% at 100 GeV, and 4% at 1 TeV, 
to be compared to about 40%, 12%, and 5% obtained jet clustering is based on calorimeter 
information only rather than on PE candidates. 

The jet energy in simulation is corrected using measurements derived from data. The jet en¬ 
ergy corrections for the CA jets are derived from the anti-kx (AK) jet clustering algorithm l27l 
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with a distance parameter value of 0.7. The AK jet energy corrections have been shown to be 
applicable to CA jets Il28l . The uncertainty associated with this procedure is noted in Section 

lO 

We use the charged hadron subtraction method Il29l to remove charged hadrons that origi¬ 
nate from a non-leading vertex prior to the application of the jet clustering algorithm, where 
the leading vertex is defined as the vertex with the largest sum of pj. After this procedure, 
the neutral component of pileup (inelastic proton-proton pair interactions in the same bunch 
crossing) is subtracted using an area based method Il30l . 

3.1 Signal modeling 

The signal samples are generated at leading order with the CompHEP Il3n package and then 
scaled to next-to-leading order using a factor of 1.2 IflTl . We generate signal samples using 
three coupling hypotheses (see Eq. Q): 

• Wjj — purely right-handed W' boson where flq;q.=0 and flq.q,=l; 

• Wl — purely left-handed W' boson where flq,q^=l and fl^.q.=0; 

• W^R — mixed-coupling W' boson where flq;q^=l and 

The Wjj width varies from 44 to 91 GeV for the mass range considered in this analysis. The 
generation of the left-handed and mixed-coupling samples takes into account the interference 
with the SM s-channel single top quark production. 

3.2 Combined CMS t-tagging algorithm 

When the W boson decays to hadrons, the top quark can be detected as three jets. The high 
boost of the top quark from a W' boson decay causes the three jets to merge into one large 
jet with a distinct substructure. The CMS t-tagging algorithm II32I discriminates signal from 
background by using this characteristic substructure. The algorithm reclusters the CA jet until 
it finds an}rwhere from 1 to 4 subjets In this process, particles with low px or at a large angular 
distance from the jet centre are omitted. The t-tagging algorithm is based on the following 
selection: 

• Jet mass 140 < Mjet < 250 GeV — The mass of the CA jet is required to be consistent 
with the top quark mass. 

• Number of subjets Ngubjets >2 — The number of subjets found by the algorithm 
must be at least 3. 

• Minimum pairwise mass Mmin > 50 GeV — The three highest px subjets are taken 
pairwise, and the pair with the lowest invariant mass is calculated (Mmin)- The value 
of Mmin is required to be greater than 50 GeV for consistency with the W boson mass. 

In addition to these requirements, the N-subjettiness algorithm is used for t-jet identification 
l33l . This algorithm defines the Tn variables, which describe the consistency between the jet 
energy and the number of assumed subjets, N: 

\ 

Tn = min{ ARiq', AR 2 ,u ..., ARnj}, 


( 2 ) 
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3 Event reconstruction 


where ARj ,■ is the angular distance (AR = V {Arj)^ + (Acp)^) measured between the subjet can¬ 
didate (J) axis and a specific constituent particle (i), and d is the normalization factor: 

d = EP^.R, (3) 

i 

where R is the characteristic distance parameter used by the jet clustering algorithm. A jet with 
energy consistent with N subjets will typically have a low tn variable. A t jet should be more 
consistent with three subjets than two (when compared to jets originating from a gluon or a 
light quark), therefore the ratio of T 3 and T 2 allows top jets to be distinguished from multijet 
events from QCD processes (labeled in the following as QCD background). We select events 
with T 3 /T 2 < 0.55. 

We apply the combined secondary vertex (CSV) b-tagging algorithm to all of the subjets found 
by the t-tagging algorithm. We require the maximum CSV discriminator value to pass CSV b 
tagging at the medium operating point (SJ^gvMAX ^ 0.679) 1341 . The CMS t tagger with the 
addition of N-subjettiness and subjet b tagging is referred to as the combined CMS t tagger. 

Substructure variables in the signal region exhibit known differences between data and simula¬ 
tion that affect the t-tagging efficiency. We derive a scale factor that is the ratio of the t-tagging 
efficiency measured in data to simulation l32l , using hadronic top quark decays from a control 
region that consists of an almost pure sample of semileptonic tf events. This ratio, which is 
measured to be 1.04 ± 0.13, is applied as a correction factor to the signal samples that are used 
in this analysis. 

3.3 Identification of b jets 

To identify the b quark daughter of the W' boson, we start with the b-candidate jet, which 
is the highest px jet that is hemispherically separated from the top-tagged jet. We apply the 
CSV algorithm used to identify b jets to this b-candidate jet. The medium operating point is 
used, which has a light-flavour jet misidentification probability of 1 % for an efficiency around 
70%. A scale factor is applied to correct for differences in b-tagging efficiency between data and 
simulation iMl . The uncertainties in this scale factor are described in Section [ 6 T] Backgrounds 
from SM tf production are reduced by requiring the invariant mass of the b-candidate CA jet 
to be below 70 GeV. 

3.4 Reconstruction of W' mass 

We select candidate W' —> tb events by using the following criteria, which are applied to the 
two leading jets: 

• One jet with pj > 450 GeV identified with the combined GMS t-tagging algorithm; 

• One jet with pj > 370 GeV with a GSV b tag at the medium operating point and 
mass <70 GeV; 

• The two jets are in opposite hemispheres (I A(^ I > n/2); 

• The difference in rapidity between the two jets (| Ay |) is less than 1. 6 . 

The number of events after each successive selection remaining in data, SM tf production, s- 
channel single top, and right-handed W' boson signal simulations is shown in Table 
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4 Event samples 

The data used for this analysis correspond to an integrated luminosity of 19.7fb^^ of pp colli¬ 
sions provided by the LHC at a centre-of-mass energy of 8 TeV. We select events online using 
a trigger algorithm that requires the scalar pj sum of reconstructed jets in the detector to be 
>750 GeV. The trigger is nearly 100% efficient for events selected in the offline analysis. The 
small trigger inefficiency is measured from data and applied to the simulated events. 

Table 1: Numbers of observed and expected events at successive stages of the event selection. 
The expected numbers are scaled to an integrated luminosity of 19.7fb^^. Statistical uncertain¬ 
ties in the event yields are quoted. The QCD background contribution is only reported for the 
final selection. The first row implies a 150 GeV pj selection on the jets, the trigger selection, and 
the requirement that the two leading jets be in opposite hemispheres. The row labeled "pj" 
represents the transverse momentum requirements placed on the two leading jets. The signal 
events, shown for several values of the W' boson mass, are normalized to the theoretical cross 
section. The s-channel single top contribution (STg) is given as well as it is used when deriving 
the shape of the signal distribution when calculating the generalized coupling limits. 


Selection 

Data 

QCD 

tt 

SL 

Mw;, 

1.90 TeV 

Mw;, 

2.10 TeV 

Mw' 

1.90 TeV 

Mw' 

2.10 TeV 

2 jets 

13854873 

— 

12190 ±27 

283 ±8.6 

806 ±1 

401 ±0.7 

796 ±2 

430 ±2 

Pt 

4305244 

— 

4720 ±18 

130 ±6.5 

739 ±1 

372 ± 0.7 

703 ±2 

364 ±2 

Dy| 

3376771 

— 

4220 ± 17 

121 ± 6.3 

553 ±1 

268 ±0.6 

531 ±2 

268 ±1 

Mt 

992949 

— 

3220 ± 14 

64 ± 4.5 

429 ±1 

209 ±0.5 

414 ±2 

205 ±1 

I'^subjets 

557489 

— 

2740 ±13 

48 ±3.9 

340 ±0.9 

163 ± 0.5 

312 ±2 

152 ±1 

^min 

318520 

— 

2510 ± 13 

42 ±3.7 

304 ±0.9 

143 ± 0.4 

274 ±2 

130 ±0.9 

SfcSVMAX 

50642 

— 

1690 ± 10 

23 ±2.6 

170 ±0.6 

76 ±0.3 

138 ±1 

63 ±0.6 

r3/T2 

7200 

— 

1024 ± 8 

11 ±1.8 

88 ±0.5 

38 ±0.2 

58 ±0.7 

27 ±0.4 

Mb 

4463 

— 

178 ±4 

8±1.6 

68 ±0.4 

29 ± 0.2 

44 ± 0.6 

20 ±0.3 

CSV 

277 

248 ±4 

37±1 

2 ±0.71 

16 ±0.2 

6 ±0.1 

10 ±0.3 

4 ±0.2 


5 Background modeling 

The primary sources of background are SM QGD multijet and tt production. These back¬ 
grounds dominate because of QGD background events after selecting an all-jet final state is 
selected and the large contribution from tf production that remains after t-jet discrimination 
criteria are applied. 

The principal background in this analysis is QGD multijet production, and is estimated using a 
data-driven technique to extract both the shape and the normalization. The average b-tagging 
rate, measured from events with an enhanced QGD multijet component and a negligible sig¬ 
nal contamination component, is used to estimate the QGD multijet contribution in the signal 
region. We account for the background contribution from SM tt production when measuring 
the b-tagging rate. In addition to the region used to extract the average b-tagging rate, we 
define two test regions to check the QGD background prediction, both of which have small tt 
background and possible signal contamination. 

The shape of the Mtb distribution for tt production is estimated from Monte Garlo (MG) simu¬ 
lation, and the yield is measured from data using a control sample enriched in tt events. 
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5 Background modeling 


5.1 QCD background estimate 

A control sample is obtained by inverting the Ngubjets selection criteria used to identify t jets: 


140 < Mjet < 250 GeV; 

(4) 

^subjets — 7, 

(5) 

SJcsvMAX > 0.679. 

(6) 


We apply the b tagging criteria to the b-candidate jet in the event to measure the average b- 
tagging rate for QCD jets. We assume that this rate is the same for QCD jets in the signal region. 
We include the subjet CSV discriminant in this control sample in order to ensure similar parton 
flavour distributions in the signal and control regions. Because of the similar parton flavour 
distributions, and the fact that this region has a low tf and signal contamination component, 
this control sample is an ideal selection to extract the average b-tagging rate. The average b- 
tagging rate is parameterized as a function of the pj of the b-candidate jets (which pass all 
requirements except the b tag) in three \r]\ regions: 

• Low (0.0 < |j/| < 0.5); 

• Transition (0.5 < |?/| < 1.15); 

• High (1.15 <\r]\< 2.4). 

Events in the signal region that do not have b tagging applied are then weighted by this average 
b-tagging rate to estimate the QCD background contribution in the final selection. Events in 
the signal region before b tagging is applied are largely from QCD background, but the small 
tf background component (less than 1%) is subtracted when deriving the QCD background 
contribution to avoid double counting. 

We use a bifurcated polynomial to fit the average b-tagging rate in each of these |?/1 ranges. This 
fitting function, which provides a satisfactory description of the data, is defined as follows: 


/(^) 


po-\-pix + p 2 {x — a)^, iix<a 
po + pix + p 3 {x - a)^, ifx>a 


(7) 


Here, the parameters po to ps are the polynomial coefficients, and z is the pj of the b-candidate 
jet. The parameter a is the bifurcation point, and is optimized for each region in |p|. It is 
chosen to be 500, 500, and 550 GeV for the low, transition, and high | rj \ regions, respectively. 
The parameterization of the average b-tagging rate helps to constrain the known pj and 1 1 ] | 
kinematic correlation inherent in b tagging, which is due to detector geometry and tracking 
resolution. 


The uncertainty in the average b-tagging rate is extracted using the full covariance matrix ob¬ 
tained from the output of the fitting algorithm. Additionally, we assign a systematic uncer¬ 
tainty to cover the choice of the fit function (see Section 6.11 based on several alternative func¬ 
tional forms (such as second degree polynomial or exponential functions). Eig. shows the 
measured average b-tagging rate and associated uncertainty bands. 


5.2 The tt background estimate 

We obtain the normalization of the background contribution from SM tt production using a 
control sample in data. A sideband region is defined (CR3) by inverting the b-candidate mass 
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P, (GeV) 


19.7fb'^ (8 TeV) 



P, (GeV) 


Figure 1: The average b-tagging rate for QCD jets parameterized as a function of the pj of 
the b-jet candidate from the low (top), transition (middle), and high (bottom) \t]\ regions. The 
measured average b-tagging rate is represented by the data points, the polynomial fit is shown 
as a solid line, and the propagated uncertainties from the fit are shown as the dashed lines. The 
horizontal lines on the data points indicate the bin widths. 
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5 Background modeling 


requirement (see Section |3.3| in the signal region. This region has an enhanced fraction of tt 
events and is statistically independent from all other sidebands in the analysis. 

We compare the sum of the QCD background estimate from data and the SM tt contribution 
obtained from MC simulation to the observed yield in data. The tf and single top quark pro¬ 
duction samples used in this analysis are generated using the POWHEG Il35ti37l 1.0 event gen¬ 
erator and are normalized to the SM expectation using next-to-next-to leading-order cross sec¬ 
tions Il38ll39| . 


For the QCD background determination in this sideband region, we use the procedure outlined 
in Section 5.1 However, we invert the b-candidate mass requirement when extracting the 


average b-tagging rate in order to account for potential correlations between the b-candidate 
mass and the b-tagging rate. 


We perform a binned maximum likelihood fit to the invariant mass distribution of the b- 
candidate jets, using the shape of the tf background MC prediction as one template, and the 
QCD background prediction from data as the other. The normalization of the QCD background 
template is allowed to vary within its systematic uncertainty envelope, whereas the normaliza¬ 
tion of the tf template is left unconstrained. The result of the fit is shown in Fig.|^ 


19.7 (8 TeV) 



200 250 


300 350 

Mje, (GeV) 


Figure 2: Distribution of the mass of the b-jet candidate after the template fit to constrain the 
QCD multijet and tf backgrounds has been applied. A control region with inverted selection 
criteria on the mass of the b-jet candidate is used for the fit, and is shown as the area to the 
right of the vertical dashed line. The hatched region represents the background uncertainty, 
obtained by adding the QCD multijet uncertainty and the uncertainty on the tf normalization 
from the output of the fit in quadrature. The bottom plot shows the pull ((data-background)/ a) 
between the data and the background estimate distributions. 


The contamination from tf events must be taken into account when obtaining the QCD back¬ 
ground template in this sideband from data. The fit described above independently varies the 
QCD and tf templates, however the component of the QCD background from tf events intro¬ 
duces a small anticorrelation between the two templates. We account for this by first fitting 
the QCD background before the tf component is subtracted and correcting the resulting tf nor¬ 
malization by the factor 1+C/S, where C/S is the ratio of the nominal number of tf events 
in the control region (C) to that in the signal region (S). After applying this correction, the tf 
normalization obtained from the fit is independent of the fraction subtracted from the QCD 
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background template. Following this procedure we find that the SM tt production rate in the 
signal region must be scaled by a factor of 1.23 it 0.24. 

In order to correct for known differences in the top quark px spectrum between data and MC 
simulation of SM tf production |40| , we reweight the MC events using the generator level px of 
the top quark and top anti-quark. Although this procedure was not designed for the kinematic 
range in our analysis, we still use it as the change to the normalization of the tf background 
template is consistent with our measurement of this extra normalization factor. 


5.3 Control region closure test 


To investigate the applicability of the QCD background estimation in data, we apply the aver¬ 
age b-tagging rate to control regions of the t-tagging selection defined in Section 3.2 First, we 
define a control region with inverted minimum pairwise mass and N-subjettmess selections 
(CRl). This region is orthogonal to both the signal region and the control region that is used 
for the determination of the average b-tagging rate. The selection also has a very low yield of 
tf production, which allows for a precise measurement of the QCD background contribution. 
The average b-tagging rate used for this closure test is extracted from the same control region 
as the signal region, and is applied to events that are not b tagged. This test shows good agree¬ 
ment as shown in Fig. |^(top). Additionally, we define a control region with an inverted subjet 
b-tagging selection (CR2). This test also shows good agreement as seen in Fig. [^(bottom). 


These control regions have low signal contamination. For the 1.90 TeV sample, the signal 

contamination is less than 1%. These control regions are summarized in Table 


Table 2: Numbers of observed and expected events for each of the sidebands used for closure 
tests of the QCD and tf background estimates. A summary of the inverted selection for each 
control region is provided. 


Control Region 

Data 

QCD 

tf 

CRl: Mmin < SOGeV; T 3 /T 2 > 0.55 

1100 

1107 

15 

CR2: SJcsvMAX < 0.679 

1376 

1466 

8 

CR3: Mb > 70GeV 

336 

121 

200 


6 Results 

After constraining the SM tf normalization using a particular control region (see Section |5.2| |, 
and investigating the agreement of the data and the QCD background estimate in two addi¬ 
tional control regions (see Section [53] l, the background estimate is used to predict the data dis¬ 
tribution in the signal region, in the absence of a W' boson contribution. The results are shown 
in Fig. 1^ Good agreement between data and expectation from SM processes are observed, with 
no signs of a new physics signal. 

6.1 Systematic uncertainties 

We consider several sources of systematic uncertainty, corresponding to uncertainties in both 
the shape and normalization of the M* distribution, which are summarized in Table 2. 

A 19% normalization uncertainty is assigned to the estimate of the SM tf normalization (see 
Section [ 5 !^ . The scale factor used to account for differences in the t-tagging efficiency between 
MC and data has an uncertainty of 13% |32]| . The uncertainty in the integrated luminosity is 
2.6% |4l|. Finally, the MC to data scale factor for the b-tagging efficiency is evaluated using AK 








10 


6 Results 


19.7fb"' (8 TeV) 




Figure 3: Distributions of Mtb shown for data, tf, and QCD in the control regions CRl (top) and 
CR2 (bottom) as defined in the text. The hatched region shows the background uncertainty, ob¬ 
tained by adding the statistical and systematic uncertainties in quadrature. The bottom plots 
show the pull ((data-background)/ a) between the data and the background estimate distribu¬ 
tions. 
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jets but is applied to CA jets, which requires a systematic uncertainty of 2% in addition to the 
b-tagging scale factor uncertainty of around 8%. 

Several uncertainty sources contribute to the estimation of the average b-tagging rate, which 
is used to evaluate the QCD background prediction. We include a contribution due to the 
uncertainties obtained from the polynomial fit used to parameterize the b-tagging rate, and 
investigate the effect of choosing alternative functional forms in the fit. We also obtain an 
estimate of the uncertainty caused by the specific parameterization of the average b-tagging 
rate by studying an alternative parameterization using pj, rj, and M*- 

We derive an uncertainty in the effect of pj reweighting on the shape of the tt background by 
taking the unweighted Mtb distribution as the + 1(1 shape. 

The uncertainty arising from the variation of the renormalization and factorization scales in tf 
production is evaluated from MC samples generated with two times (-|-lcr) and one half (—Icr) 
the nominal renormalization and factorization scales. 

The b-tagging scale factor ±.\a values are applied to tt production and signal MC Il34ll . The 
nominal jet energy corrections created for use with AK jets are applied in the analysis. The ±lcr 
uncertainties in jet energy scale and resolution arising from the application of AK jet energy 
scale corrections are also considered. Additionally, a 3% uncertainty is applied to the jet energy 
scale to account for differences in CA and AK jets. Finally, the uncertainty in the trigger turn-on 
efficiency that is applied to the MC samples is taken as one half of the trigger inefficiency 

Uncertainties arising from parton distribution functions are studied by varying the eigenval¬ 
ues of the parton distribution functions that are used in the simulation. Pileup modeling in 
simulation is corrected by comparing the number of pileup interactions to the mean number 
of interactions in data. The uncertainty on this correction is studied by varying the minimum 
bias cross section. These sources, along with the jet angular resolution variation, are negligible. 

The two largest systematic effects arise from the uncertainties in the average b-tagging rate for 
the QCD background (approximately a 6% normalization effect) and in the top tagging scale 
factor for the signal (13%). 

Table 3: Sources of uncertainty that affect the Mtb distribution and the ±1(7 variations that are 
used in the fit. 


Source 

Variation 

tf normalization 

19% 

t-taggmg scale factor 

13% 

Integrated luminosity 

2 .6% 

anti-kx to CAS jets b-tagging scale factor 

2 % 

Average b-tagging rate fit 

±1(7(ptM) 

Alternate functional forms for the average b-tagging rate 

±la{pj,rj) 

Parameterization choice for the average b-tagging rate 

±la{pj,ri,Mth) 

Pj reweighting 

Shape ±1(7 (ptj, pj-) 

Renormalization and factorization scales 

2Q^ and 0.5Q^ 

b-tagging scale factor 

±1(7(pt) 

Jet energy scale 

±la{pj,r]) 

Jet energy resolution 
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6 Results 


19.7 fb'^ (8 TeV) 



Figure 4: The distribution of M* shown for data, tt, QCD, single top, and several example 
signal W' boson mass hypotheses. The normalization for the W' signal samples assumes the 
cross section from theory. The distributions are shown after the application of all selection 
criteria. The background contribution from single top quark production is not considered when 
setting limits. The hatched region shows the background uncertainty, obtained by adding the 
statistical and systematic uncertainties in quadrature. The bottom plot shows the pull ((data- 
background)/ a) between the data and the background estimate distributions. 
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6.2 Cross section limits 

To set limits on the production cross section of the boson model described in Eq. Q, we 
compare, for each bin in the M* distribution, the numbers of observed and expected events. 
The small background contribution from single top quark production is not considered when 
setting limits. The following expression is used to compute the expected contribution from 
boson production: 


^expected ^W[j^tb;W^’hadrons ^ J ^ 

where is the cross section, <Bw^^tb;W^hadrons is the branching fraction ^ tb with 
the W boson decay constrained to the hadronic branching fraction, e is the signal efficiency, and 
J L df is the integrated luminosity of the data set. We perform a binned maximum likelihood 
fit to compare the Mtb distribution from data with the boson signal hypothesis, summed to¬ 
gether with the SM distribution obtained from the background estimation procedure described 
in Section]^ 

A Poisson model is used for each bin of the M* distribution. The mean of the Poisson distri¬ 
bution for each bin is taken to be: 


m = (9) 

k 

where k includes both the signal and background models, is the Poisson mean for process k, 
and represents the fraction of events expected for each process k in bin i. 

The likelihood function is then: 


^bins 

m) = n 


Nf 


(jydata^j 


( 10 ) 


where is the number of events in data for bin i. 

Using a Bayesian approach with a flat prior for the signal cross section, we obtain 95% confi¬ 
dence level (CL) upper limits on the production cross section of W^. Pseudo-experiments are 
used to derive the ±1(7 deviations in the expected limit. The systematic uncertainties described 
above are accounted for by nuisance parameters and the posterior probability is refitted for 
each pseudo-experiment. The cross section upper limits are shown in Fig. We exclude a W^ 
boson with a mass less than 2.02 TeV at 95% CL . 

We combine the results from the hadronic and leptonic W' decay modes to enhance the sensi¬ 
tivity of the analysis to W' production and the measurement of the coupling strengths of the 
W' boson to quarks. The analysis of the leptonic channel excludes a W' mass below 2.05 TeV, 
and is described in Ref. |l6i|. The all hadronic and leptonic channels have similar sensitivity in 
the high W' mass regime, which leads to a large increase in sensitivity in the combined result. 

The hadronic channel probes W' signal generated from a mass of 1.3 to 3.1 TeV because of the 
sensitivity of the boosted top jet tagging techniques, whereas the leptonic channel probes W' 
masses from 0.8 to 3.0 TeV because of the higher sensitivity at low W' mass. Therefore, the 
region of combined sensitivity ranges from a W' mass of 1.3 to 3.0 TeV. Below this region, only 
the leptonic channel limits are quoted. 
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6 Results 


There are points within the region of combined sensitivity where the signal sample exists for 
the leptonic channel but not for the hadronic channel. These intermediate mass points are re¬ 
produced using ROOFit i42l template morphing to interpolate the shape of the M* spectrum. 
The generator level selection on the pj of the b quark for the left-handed and mixed-coupling 
W' samples is taken into account by interpolating the selection efficiency for the intermediate 
mass points. 

We assume that the uncertainties in jet energy scale, jet energy resolution, b-tagging scale fac¬ 
tors, and the total integrated luminosity are correlated between the two samples. All other 
systematic uncertainties are assumed to be uncorrelated. These include the renormalization 
and factorization scale and pj reweighting uncertainties, since different generators are used 
for the simulation of tt events and the hadronic channel extracts the tf normalization from 
data. The method for setting combined cross section upper limits is identical to the limit set¬ 
ting procedure of the all hadronic channel, except a joint likelihood is used that considers both 
channels. 

The combined cross section upper limits are shown in Fig. A boson with a mass 
below 2.15 TeV is excluded at 95% CL. 
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Figure 5: The boson 95% CL production cross section times Byji limits for the hadronic 
channel. The observed (solid) and expected (dashed) limits, as well as the boson theoretical 
cross section (dot-dashed) are shown. As indicated in the legend, the shaded regions about the 
expected limits represent 1 and 2a bands. 


6.3 Generalized coupling limits 

To set limits on generic couplings, we use the procedure outlined in Ref. O. Because the 
left-handed and mixed-coupling samples cannot be separated from SM single top quark pro¬ 
duction, we set limits on the couplings a^ and a*^. The SM single top quark production is 
negligible when setting cross section limits. As it is used for signal modeling in the generalized 
coupling analysis, it needs to be included in that calculation. For limit setting, we reweight the 
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Figure 6: The boson 95% CL production cross section times liinits for the combined 

hadronic and leptonic channels. The observed (solid) and expecteii (dashed) limits, as well as 
the boson theoretical cross section (dash-dotted) are shown. As indicated in the legend, the 
shaded regions about the expected limits represent 1 and 2cr bands. The region to the left of the 
vertical dashed line shows the limits derived only from the leptonic channel. The right of the 
vertical dashed line shows limits based on the combined hadronic and leptonic channels. 


signal templates (single top quark, W^, W^, W^r) to form a combined signal template with the 
following cross section: 


"ud'“ud'®tb'"tb 


1 _ -L ^ _ I -R -R 


«ud«tb+«ud4 


R 


L ^ud^tb '^ud'^* ^ 9 '^ud'^ft^ud'^tb 


+ «kd«tb L L , R R 
“ud“tb ^ “ud“tb 


«ud4+« 


R „R 


ud'^tb 


( 11 ) 


where a^i^, are the cross section for right-handed, left-handed, or mixed samples, 

respectively and is the SM s-channel single top quark production cross section. We assume 
that the left-handed and right-handed coupling constants are equal for first and third genera¬ 
tion quarks (fl^^ = fl[j, and = fl|,). 

The templates are then summed and cross section upper limits are set using the resultant yield 
as the signal process for the given values of and a^. Limits are calculated using pairwise 
combinations of the couplings from 0 to 1 in increments of 0.1. 

Using these cross section upper limits, we obtain the values where the Mw' cross section limits 
are equal to the theoretical cross section prediction for a given combination of and a^. These 
points are the maximum excluded W' boson mass for given values of and Exclusion 
limits when considering generalized couplings can therefore be represented with contours on 
a two dimensional plot in and and the contour is the maximum excluded W' boson mass. 
These contours in the (fl^, fl^) plane are shown in Fig. for both observed and expected limits 
in the hadronic channel. The mass upper limits for left-handed and mixed-coupling W' bosons 
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7 Summary 


are 1.92 and 2.15 TeV, respectively. 

For this procedure, no systematic uncertainty is considered from single top quark production 
since the templates are dominated by statistical uncertainties. 

Additionally, we present combined limits on the W' coupling strengths, and The limits 
for the leptonic channel that are used for the combination are shown in Fig. These are up¬ 
dated from Ref. |6J to include the effect of the W' width on the cross section of an and 
combination. The limits for the combined hadronic and leptonic channels are shown in Fig. 

7 Summary 

A search for a new massive gauge boson W' decaying to a top and a bottom quark with a 
hadronic signature has been performed using proton-proton collisions recorded by the CMS 
detector at \/s = 8 TeV, corresponding to an integrated luminosity of 19.7fb^^ The search is 
focused on W' masses above 1.30 TeV, where the main feature of this topology is a top quark 
whose decay products merge into a single jet. 

The analysis uses jet substructure algorithms to distinguish the top quark jet from standard 
model hadronic jet backgrounds. The principal background, from QCD multijet production, 
is estimated from data using the average b-tagging rate measured in a QCD enhanced con¬ 
trol region. The other important source of background, from standard model tf production, is 
estimated from simulation, and is corrected by a scale factor derived from control samples in 
data. 

Limits are placed on the production cross section of a right-handed W' boson, together with 
constraints on the left-handed and right-handed couplings of the W' boson to quarks. The 
production of a right-handed W' boson with a mass below 2.02 TeV decaying to a hadronic 
final state is excluded at 95% confidence level. The lower limit on the mass of the right-handed 
W' boson increases to 2.15 TeV at 95% confidence level when both hadronic and leptonic decays 
are considered, and is the most stringent lower mass limit to date in the tb decay mode. 
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Figure 7: Contour plots of Mw' in the (a^, plane in the hadronic channel. The top (bottom) 
plot shows observed (expected) limits. The contour shading indicates the values of Mw' where 
the theoretical cross section is equal to the observed or expected 95% CL limit. 










18 


7 Summary 





Figure 8: Contour plots of Mw' in the (a^, plane in the leptonic channel. The top (bottom) 
plot shows observed (expected) limits. The contour shading indicates the values of Mw' where 
the theoretical cross section is equal to the observed or expected 95% CL limit. These are up¬ 
dated from Ref. |6J to include the effect of the W' width on the cross section of an and 
combination. 
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Figure 9: Contour plots of Mw' in the (a^, plane using the combined hadronic and leptonic 
channels. The top (bottom) plot shows observed (expected) limits. The contour shading indi¬ 
cates the values of Mw' where the theoretical cross section intersects the observed or expected 
limit band. The solid contours represent combined limits, and the dashed contours indicate 
limits that are obtained purely from the leptonic channel. 
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